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Distributed Multicast Caching Technique 

BACKGROUND OF THE INVENTION 

1. Field of the Invention. 

5 This invention relates to multicast transmission of 

information across a data network. More particularly this 
invention relates to an improved caching technique for 
use in multicasting over a data network. 

2. Description of the Related Art. 

10 The push model for distributing data over the Inter- 

net and other client server networks has become more 
widespread in recent years. In modern versions of this 
model a server "multicasts" data to an interested subset 
of clients on the network, known as a "multicast group" . 

15 Whoever is interested becomes a listener by joining the 
group . 

By their nature, push applications are closer to the 
broadcasting paradigm of radio and television than to the 
interactive paradigm of the World Wide Web. As such, 

20 broadband networks, such as cable TV or satellite, can be 
used as a very efficient medium for the transmission of 
"pushed data". Unfortunately, currently these networks 
are one-way only. That is to say, data such as a televi- 
sion program is sent from a broadcasting facility (the 

25 head-end) to several receivers (end-users) without any 
feedback. As such, these networks are inappropriate for 
popular interactive push applications since the latter 
require a return channel. Although attempts to upgrade 
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the current public network infrastructure are underway in 
several places around the world, it will take some years 
until reliable two-way broadband networks are commonplace 
and therefore, a mechanism for multicasting over one-way 
5 broadband networks is desirable. 

Caching systems have been developed to improve the 
efficiency of data transmission across the internet. Cur- 
rent internet caching systems, however, are based on the 
unicast TCP/IP transport protocol. Various technical is- 

10 sues have impeded the development of caching systems 
adapted to multicast transmission. Using the unicast 
transport protocol, a document is required to be deliv- 
ered separately to each requesting client of the cache. 
For example, when two clients request the same documents 

15 simultaneously, or within a short interval, the cache 
transmitter is required to read the document twice, and 
send it twice. Consequently the resources of both the 
cache transmitter and the internet are used ineffi- 
ciently. 

2 0 The document, Reliable Multicast Transport Protocol , 

Shioshita, Teruji et al., Draft Document for the 37 th 
IETF, Feb. 7, 1997, proposes a transport control mecha- 
nism to enable reliable multicast data transfer to a 
large number of receivers on a TCP/IP network from a 

25 server in parallel. This protocol promotes short delivery 
time, as the data is transferred only once, and conserves 
bandwidth because only one copy of the data is sent to 
the server. It has the advantage of requiring only a sin- 
gle session regardless of the number of receivers. How- 
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ever, despite some optimizations, there remains a re- 
quirement for receiver confirmation by ACK/NAK responses 
and the retransmission of data to selected receivers 
based on the information associated with the NAK response 
5 are disadvantages, as large numbers of receivers issuing 
ACK/NAK responses can still cause network congestion. 

Another known multicast transport protocol is pro- 
posed in Starburst Multicast File Transfer Protocol 
(MFTP) Specification. Miller, K. et al . , Internet Draft, 
10 April 1998. This protocol operates in the Application 
Layer . 

In copending U.S. Application No. 09/138,994, filed 
Aug. 24, 1998, of common assignee herewith, and hereby 
incorporated by reference, a technique of IP multicasting 

15 over existing broadband networks without using a return 
link is disclosed. This technique allows the issues of 
multicast group membership and error detection and recov- 
ery to be handled locally within an end-user terminal, 
without need for returning data to a host. According to 

20 the technique a single data transmitter sends a group of 
data items to a subset of possible receivers over a 
one-way channel. Each data item is divided into blocks 
which are encapsulated to form datagrams, each including 
a block sequence number, a data item identifier, and a 

25 timestamp indicating the age of the data item. A group 
directory is regularly sent by the transmitter to each of 
the possible receivers. The group directory contains in- 
formation for all groups of data items, enabling each re- 
ceiver to select the group of data item it wishes to re- 
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ceive. Reliability is provided by periodic retransmission 
of missing data. Despite these advantages, significant 
problems remain. 
SUMMARY OF THE INVENTION 

It is a primary advantage of some aspects of the pre- 
sent invention that there is improved caching of content 
that is multicast across a data network. 

A caching arrangement for the content of multicast 
transmission across a data network utilizes a first 
cache, which receives content from one or more content 
providers. Using the REMADE protocol, the first cache 
constructs a group directory. The first cache forms the 
root of a multilevel hierarchical tree. In accordance 
with configuration parameters, the first cache transmits 
the group directory to a plurality of subsidiary caches. 
The subsidiary caches may reorganize the group directory, 
and relay it to a lower level of subsidiary caches. The 
process is recursive, until a multicast group of end-user 
clients is reached. Requests for content by the end-user 
clients are received by the lowest level cache, and 
should the content not be available, the request is for- 
warded toward the root of the tree until it is found. The 
content is then returned to the requestors. Various lev- 
els of caches retain the group directory and content ac- 
cording to configuration options, which can be adaptive 
to changing conditions such as demand, loading, and the 
like. The behavior of the caches may optionally be modi- 
fied by the policies of the content providers. 



IL9-2000-0052 



38930 



Ver. 38930S2 



5 

It is an advantage of some aspects of the invention 
that content need only be transmitted once to multiple 
receivers . 

The invention provides a method of transmitting data 
5 over a communications network which includes receiving 
content from a content provider, and responsive to the 
content establishing a first group directory in a cache. 
The method includes transmitting the first group direc- 
tory from the cache on a data channel to a subsidiary 
10 cache, establishing a second group directory in the sub- 
sidiary cache, in which the second group directory is de- 
rived from the first group directory, and transmitting 
the second group directory from the subsidiary cache to a 
multicast group of receivers. 
15 According to an aspect of the invention, the first 

group directory is transmitted using the REMADE protocol. 
According to still another aspect of the invention, 
the first group directory is transmitted periodically. 

According to an additional aspect of the invention, 
20 the first group directory is transmitted in response to a 
request from a receiver. 

According to another aspect of the invention, the 
first group directory is transmitted according to a pol- 
icy of the content provider. 
25 According to an aspect of the invention, the second 

group directory is transmitted periodically. 

According to still another aspect of the invention, 
the second group directory is transmitted in response to 
a request from a receiver. 
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According to an aspect of the invention, the second 
group directory is transmitted using a REMADE protocol. 

According to yet another aspect of the invention, the 
second group directory is transmitted according to a pol- 
5 icy of the content provider . 

According to a further aspect of the invention, the 
content provider is a plurality of content providers. 

According to another aspect of the invention, the 
subsidiary cache is a plurality of subsidiary caches. 
10 According to a further aspect of the invention, the 

cache and the subsidiary caches are linked together as a 
hierarchical tree, the cache forming a root of the hier- 
archical tree. 

Still another aspect of the invention includes re- 
15 ceiving a transmission request from a member of the group 
of receivers, wherein the transmission request is respon- 
sive to the second group directory, and responsive to the 
transmission request, transmitting a data item from the 
subsidiary cache to the receiver, 
20 According to still another aspect of the invention, 

the first group directory includes a root directory hier- 
archically linked to a plurality of subdirectories. The 
subdirectories carry a list of data items. A subtree of 
the first group directory is defined by one of the subdi- 
25 rectories and at least one linked subdirectory thereun- 
der . 

According to yet another aspect of the invention, the 
second group directory includes a root directory hierar- 
chically linked to a plurality of subdirectories. The 
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subdirectories carry a list of data items. A subtree of 
the second group directory is defined by one of the sub- 
directories and at least one linked subdirectory thereun- 
der . 

5 The invention provides a computer software product, 

comprising a computer-readable medium in which computer 
program instructions are stored, which instructions, when 
read by at least one computer, causes the computer to 
execute a method of transmitting data over a communica- 

10 tions network. The method includes receiving content in a 
first server from a content provider, and responsive to 
the content establishing a first group directory in a 
cache of the first server. The method further includes 
transmitting the first group directory from the cache on 

15 a data channel to a second server that has a subsidiary 
cache, establishing a second group directory in the sub- 
sidiary cache, wherein the second group directory is de- 
rived from the first group directory, and transmitting 
the second group directory from the subsidiary cache to a 

20 multicast group of receivers. 

The invention provides a system for transmitting data 
over a communications network, which includes a first 
server, having a cache therein, The first server receives 
content from a content provider, and responsive to the 

25 content, establishes a first group directory in its 
cache, and transmits the first group directory to a sec- 
ond server having a subsidiary cache. 

According to an aspect of the invention, the cache 
and the subsidiary caches are linked together as a hier- 
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archical tree, the cache forming a root of the hierarchi- 
cal tree. 

BRIEF DESCRIPTION OF THE DRAWING 

For a better understanding of these and other objects 
5 of the present invention, reference is made to the de- 
tailed description of the invention, by way of example, 
which is to be read in conjunction with the following 
drawings , wherein : 

Fig. 1 is a schematic diagram of a multicasting cache 
10 arrangement; 

Fig. 2 is a block diagram of a group directory; and 

Fig. 3 is a schematic diagram of a multicasting cache 
arrangement in accordance with the invention that employs 
a hierarchical tree structure. 
15 DESCRIPTION OF THE PREFERRED EMBODIMENT 

In the following description, numerous specific de- 
tails are set forth in order to provide a thorough under- 
standing of the present invention. It will be apparent 
however, to one skilled in the art that the present in- 
20 vention may be practiced without these specific details. 
In other instances well known circuits, control logic, 
and the details of computer program instructions for con- 
ventional algorithms and processes have not been shown in 
detail in order not to unnecessarily obscure the present 
25 invention. 

Software programming code, which embodies the present 
invention, is typically stored in permanent storage of 
some type, such as a computer readable medium. In a cli- 
ent/server environment, such software programming code 
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may be stored on the client or a server. The software 
programming code may be embodied on any of a variety of 
known media for use with a data processing system, such 
as a diskette, or hard drive, or CD-ROM . The code may be 
5 distributed on such media, or may be distributed to users 
from the memory or storage of one computer system over a 
network of some type to other computer systems for use by 
users of such other systems. The techniques and methods 
for embodying software program code on physical media 

10 and/or distributing software code via networks are well 
known and will not be further discussed herein. 

Turning now to the drawings, and to Fig. 1 thereof, 
there is shown the high level architecture of a first em- 
bodiment of a multicasting system 10 which can be em- 

15 ployed using the techniques of the present invention. The 
source of the files is a content provider 12, which is 
provided with transmitting capability, but may lack re- 
ceiving capability. The content provider 12 is a parent 
with respect to a downstream cache 14. Content is multi- 

20 cast via a data network 16, which may be the Internet, to 
a plurality of end-user clients 18. 

It is possible for the cache 14 to receive content 
from a plurality of content providers. For example both 
the content provider 12 and the content provider 20 may 

25 submit a catalog to the cache 14, which then combines in- 
formation from the two catalogs to formulate its own 
catalog or group directory for subsequent multicast. 

The cache 14 employs the REMADE protocol to multicast 
the content. The REMADE protocol is disclosed in the 
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above noted U.S. Application No. 09/138, 994. The REMADE 
protocol is a technique of IP multicasting over existing 
broadband networks without using a return link. This 
technique allows the issues of multicast group membership 
5 and error detection and recovery to be handled locally 
within an end-user terminal, without need for returning 
data to a host. According to the technique, a single data 
transmitter sends a group of data items to a subset of 
possible receivers. Each data item is divided into 

10 blocks, which are encapsulated to form datagrams, each 
including a block sequence number, a data item identi- 
fier, and a timestamp indicating the age of the data 
item. A catalog, comprising a group directory is regu- 
larly sent by the transmitter to each of the possible re- 

15 ceivers. The group directory contains information for all 
groups of data items, enabling each receiver to select 
the group of data item it wishes to receive. Reliability 
may be provided by periodic retransmission of missing 
data. 

20 In some embodiments, improvements in the REMADE pro- 

tocol which were disclosed in our copending Application 
No. 09/564,387, filed May 3, 2000, and herein incorpo- 
rated by reference, may be used in the practice of the 
present invention . 

25 Referring now to Fig. 2, a representative group di- 

rectory, directory system 30, is shown. The directory 
system 30 is a preferably a hierarchical arrangement, and 
comprises a root directory 32, with links to subdirecto- 
ries 34. The root directory 32, and any of the subdirec- 
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tories 34 may have links to general data items 36, or to 
patches 38. The arrangement is recursive, with subdirec- 
tories 34 having links to more deeply nested subdirecto- 
ries, such as subdirectory 40. 
5 Referring again to Fig. 1, functionally the cache 14 

operates according to a service policy that is appropri- 
ate for a particular application. Files may be retained 
or erased, in whole or in part, following a multicast, 
depending on various factors, which may include file 

10 size, the characteristics of the provider or the recipi- 
ent, average demand for the content, currently available 
free space, and many other relevant parameters. 

As shown in Fig. 3, a second embodiment of the inven- 
tion employs a multilevel hierarchy 22 of cache servers. 

15 As in the first embodiment, a plurality of content pro- 
viders 12, 20 provide content to a high level cache 24, 
The content providers 12, 20 have transmitting capabil- 
ity, but may lack receiving capability- In addition to 
providing content, the content providers 12, 20 option- 

20 ally act as policy control servers, specifying the re- 
cipients to whom the content may be delivered, and the 
manner of delivery. In default of a content delivery pol- 
icy received from either of the content providers 12, 20, 
the cache 24 executes an internal policy. In any case the 

25 interaction of the cache 24 and other levels of the mul- 
tilevel hierarchy 22 is preferably determined by internal 
configuration options, rather than by the content provid- 
ers 12, 20. 
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Upon request, or in accordance with its policy con- 
trol, the cache 24 delivers catalog and content in accor- 
dance with the REMADE protocol over a data network, which 
may be the Internet, to subsidiary caches 26. The sub- 
sidiary caches 2 6 have both receiving and transmitting 
capabilities, and depending upon the attributes of their 
respective clients, may independently reorganize the 
catalog that was received from the cache 24. In Fig. 3, 
each of the subsidiary caches 26 has end-user clients 28. 
It should be noted that while the multilevel hierarchy 22 
is represented for clarity in Fig. 3 as a two-level tree, 
comprising the cache 24 and the subsidiary caches 26, 
there can be any number of levels in the tree-structured 
hierarchy, as appropriate for a particular application. 
In such case, the subsidiary caches 2 6 communicate with a 
lower level of caches, rather than directly with the 
end-user clients 28, 

Typically, the end-user clients 28 have both trans- 
mitting and receiving capability. When the subsidiary 
caches 26 have organized their data inventory into a 
tree-structured catalog according to the REMADE protocol, 
as disclosed more fully in the above noted Application 
No. 09/138,994, they transmit it to all the end-user cli- 
ents 28, or to a predefined multicast group of the 
end-user clients 28. The end-user clients 28 receive a 
relevant part of the catalog, or may receive the whole 
catalog. They choose a document, and begin receiving it 
according to the REMADE protocol. Of course, various mem- 
bers end-user clients 28 may choose different documents, 
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in which case all the documents are transmitted according 
to the governing policy. Clients not currently members of 
the multicast group may in some circumstances elect to 
join it. Based on considerations such as the average num- 
ber of requests for particular content specified in the 
catalog, the subsidiary caches 2 6 can independently de- 
cide to elect a periodic mode of transmission of the 
catalog or the content, or to transmit either or both of 
them on demand. In like manner, the cache 24 can elect a 
mode of transmission of its catalog to the subsidiary 
caches 26. 

In the multilevel hierarchy 22, should a particular 
one of the subsidiary caches 26 lack a file, or portion 
of a file, requested by a receiver, such as one or more 
of the end-user clients 28, it obtains the missing parts 
from the cache 24. These parts are immediately resent to 
the receiver, which considerably reduces latency from the 
point of view of the receiver. 

In a downstream push mode of operation, if a particu- 
lar content is designated according to the service policy 
as content in high demand, then the cache 14 may, even in 
the absence of a request from any of the end-user cli- 
ents 18, multicast the content. In this mode, the content 
is flagged, requiring any downstream caches to begin re- 
ceiving the content immediately, without waiting for 
transmission requests from clients. In the case of a mul- 
tilevel cache hierarchy, the caches in each level may 
push the content down to other levels. As in the other 
modes of operation, the behavior of the caches at all 
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levels is controlled by configuration parameters, option- 
ally modified by the policies of higher level caches, or 
of the content providers 12, 20. 

While this invention has been explained with refer- 
ence to the structure disclosed herein, it is not con- 
fined to the details set forth and this application is 
intended to cover any modifications and changes as may 
come within the scope of the following claims: 
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